Parallel Management of Large Dynamic Shared Memory Space: A Hierarchical FEM Application
نویسندگان
چکیده
We show in this paper the memory management issues raised by a parallel irregular and dynamic hierarchical application, which constantly allocates and deallocates data over an extremely large virtual address space. First, we show that if memory caches data locality is necessary, a lack of virtual pages locality may greatly a ect the obtained performance. Second, fragmentation and contention problems associated with the required parallel dynamic memory allocation are presented. We propose practical solutions and discuss experimentation results obtained on a cache{coherent non uniform memory access (ccNUMA) distributed shared memory SGI Origin2000 machine.
منابع مشابه
Hierarchical Work Stealing on Manycore Clusters
Partitioned Global Address Space languages like UPC offer a convenient way of expressing large shared data structures, especially for irregular structures that require asynchronous random access. But the static SPMD parallelism model of UPC does not support divide and conquer parallelism or other forms of dynamic parallelism. We introduce a dynamic tasking library for UPC that provides a simple...
متن کاملA Dynamic Task Graph Parallelization Approach
The general purpose Finite Element system PERMAS [1] has been parallelized by a dynamic task graph approach capable to support shared and distributed parallel computer ar-chitectures as well as workstation clusters. The method used to parallelize this large application software package is of high generality and has the capability to parallelize all mathematical operations in a FEM analysis – no...
متن کاملParallel Hierarchical Radiosity On Cache-Coherent Multiprocessors
Computing radiosity is a computationally very expensive problem in computer graphics. Recent hierarchical methods have greatly speeded up the computation of first diffuse and now also specular radiosity. We present a parallel algorithm for computing both diffuse and specular radiosity together, and examine its performance in detail on cache-coherent shared address space multiprocessors. We comp...
متن کاملHierarchical Coherency Management for Shared Virtual Memory Multicomputers
For many applications of scalable multicomputers with distributed memory it is desirable to provide transparent shared virtual memory. For such applications, the hardware and system software must maintain coherency among the local memories. Most existing coherency schemes for multicomputers manage memory uniformly at a single granularity of fixed size pages or cache blocks, leading to unnecessa...
متن کاملA Data Management and Communication Layer for Adaptive, Hexahedral FEM
The parallel realization of adaptive finite element methods (FEM) has to deal with several irregular and dynamic algorithmic properties caused by adaptive mesh refinement (AMR). For an implementation on distributed memory machines irregular communication behavior results from dynamically growing data structures and statically unknown communication partners. An efficient parallel implementation ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000